Workshop AIL Active and Incremental Learning

نویسندگان

  • Vincent Lemaire
  • Jean-Charles Lamirel
  • Pascal Cuxac
  • Tomas Arredondo
  • Alexis Bondu
  • Fabrice Clérot
  • José García-Rodríguez
چکیده

Most machine learning techniques assume, either explicitly or implicitly, that the data-generating process is stationary. This assumption guarantees that the model learnt during the initial training phase remains valid over time and that its performance is in line with our expectations. Unfortunately, this assumption does not truly hold in the real world representing, in many cases, a simplistic approximation of the reality. The talk will describe the Just-In-Time (JIT) approach that is a flexible tool implementing the detection/adaptation paradigm to cope with evolving processes. Solutions following this approach improve the knowledge about the model in stationary conditions by exploiting additional information coming from the field during the operational life. Differently, in nonstationary conditions, as soon as a change in the data-generating process is detected, the learnt model is discarded and a suitable one activated to keep the performance. As a valuable and challenging application of the proposed approach, JIT classifiers for concept drift will be detailed and discussed. Incremental Decision Tree based on order statistics Christophe Salperwyck1 and Vincent Lemaire2 Abstract. New application domains generate data which are not persistent anymore but volatile: network management, web profile modeling... These data arrive quickly, massively and are visible just once. Thus they necessarily have to be learnt according to their arrival orders. For classification problems online decision trees are known to perform well and are widely used on streaming data. In this paper, we propose a new decision tree method based on order statistics. The construction of an online tree usually needs summaries in the leaves. Our solution uses bounded error quantiles summaries. A robust and performing discretization or grouping method uses these summaries to provide, at the same time, a criterion to find the best split and better density estimations. This estimation is then used to build a naı̈ve Bayes classifier in the leaves to improve the prediction in the early learning stage. New application domains generate data which are not persistent anymore but volatile: network management, web profile modeling... These data arrive quickly, massively and are visible just once. Thus they necessarily have to be learnt according to their arrival orders. For classification problems online decision trees are known to perform well and are widely used on streaming data. In this paper, we propose a new decision tree method based on order statistics. The construction of an online tree usually needs summaries in the leaves. Our solution uses bounded error quantiles summaries. A robust and performing discretization or grouping method uses these summaries to provide, at the same time, a criterion to find the best split and better density estimations. This estimation is then used to build a naı̈ve Bayes classifier in the leaves to improve the prediction in the early learning stage.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing the Efficiency of Electronic Learning and Workshop Learning on Knowledge and Performance of Nursing Students in Controlling Nosocomial Infections

Background Being familiar with new teaching methods and comparing their result helps teachers achieve better planning for applying such methods in the future. This study is aimed on comparing the efficiency of electronic learning and workshop on knowledge and performance of nursing students in controlling nosocomial infections. Methods Two groups were selected by pre-test post-test method. Stud...

متن کامل

A Hybrid Framework for Building an Efficient Incremental Intrusion Detection System

In this paper, a boosting-based incremental hybrid intrusion detection system is introduced. This system combines incremental misuse detection and incremental anomaly detection. We use boosting ensemble of weak classifiers to implement misuse intrusion detection system. It can identify new classes types of intrusions that do not exist in the training dataset for incremental misuse detection. As...

متن کامل

The Effect of a Learning and Study Skills Workshop on Talented Students’ Learning and Study Strategies in Isfahan University of Medical Sciences

Introduction: The significance of learning strategies used by students in their academic achievement has been argued. In this study, the effect of a study skills educational workshop on employing learning and study strategies by talented students of Isfahan University of Medical Sciences has been investigated. Methods: This quasi-experimental study was performed on 40 talented students of Isfa...

متن کامل

Presenting the Course of Community Dentistry as Problem Based Learning Workshop and Comparing It to Learning through Lecture

Introduction: Nowadays, the attitudinal aspect of community dentistry course and the way in which this course is taught is critically important. The aim of this study is to compare the presentation of theoretical community dentistry course through workshops using PBL method with that of contemporary method of lecture. Methods: In this quasi experimental study, 64 dental students of Tehran Univ...

متن کامل

Comparing the Training of the Mass Casualty Incident Triage by the Virtual Workshop and Non-virtual Workshop (Face-to-Face) on the Nursing Students' Learning and Satisfaction

Introduction: Workshop training is one of the most effective learning methods. This study aims to compare the effect of virtual and face-to-face workshop training on students' learning and satisfaction. Methods: A three-group intervention study with before and after measurements was conducted with the participation of 106 nursing students. A convenient method of assigning replacement blocks wa...

متن کامل

Ac 2010-1036: Workshop on Designing Active Learning Activities and Associated Assessment Plans

Although the pedagogical advantages of active learning are apparent from the literature, the use of these techniques is not yet pervasive in the engineering curricula. This is due, in part, to the lack of a smooth implementation path. A major roadblock to implementation of active learning techniques is the lack of "ready-to-use" active learning products (ALPs) and procedures. To remedy this, ov...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012